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Abstract. We revisit the SeqBin constraint 1 1 1. This meta-constraint subsumes 
a number of important global constraints like CHANGE 0, SMOOTH (5| an d 
INCREASINGN Value (4). We show that the previously proposed filtering algo- 
rithm for SeqBin has two drawbacks even under strong restrictions: it does not 
detect bounds disentailment and it is not idempotent. We identify the cause for 
these problems, and propose a new propagator that overcomes both issues. Our 
algorithm is based on a connection to the problem of finding a path of a given 
cost in a restricted n-partite graph. Our propagator enforces domain consistency 
in Oind 2 ) and, for special cases of SeqBin that include CHANGE, SMOOTH and 
INCREASINGNVALUE in 0(nd) time. 

1 Introduction 

Global constraints are some of the jewels in the crown of constraint programming. 
They identify common structures such as permutations, and exploit powerful math- 
ematical concepts like matching theory, and computational techniques like flow algo- 
rithms to deliver strong pruning of the search space efficiently. Particularly eye-catching 
amongst these jewels are the meta-constraints: global constraints that combine together 
other constraints. For example, the CardPath meta-constraint |3| counts how many 
times a constraint holds down a sequence of variables. The SeqBin meta-constraint 
was recently introduced in [TJ to generalize several different global constraints used 
in time-tabling, scheduling, rostering and resource allocation. It also generalizes the 
CardPath constraint where the constraint being counted is binary. Our aim is to re- 
visit the SeqBin meta-constraint and give a new and efficient propagation algorithm. 

2 Background 

We write D(X) for the domain of possible values for X, lb(X) for the smallest value 
in D{X), ub(X) for the greatest. We will assume values range over to d. A constraint 
is domain consistent (DC) if and only if when a variable is assigned any of the values 
in its domain, there exist compatible values in the domains of all the other variables of 
the constraint. Such an assignment is called a support. A constraint is bound consistent 
(BC) if and only if when a variable is assigned the lower or upper bound in its domain, 
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there exist compatible values between the lower and upper bounds for all the other vari- 
ables. Such an assignment is called a bound support. A constraint is bounds disentailed 
when there exists no solution such that each variable takes value between its lower and 
upper bounds. A constraint is monotone if and only if there exists a total ordering -< of 
the domain values such that for any two values v, w if v -< w then v can be replaced 
by w in any support 0. We define tt = (^ hoUom ■— Q -<...-< d =: n top ). A binary 
constraint is row-convex if, in each row of the matrix representation of the constraint, 
all supported values are consecutive (i.e., no two values with support are separated by a 
value in the same row without support) j|6|. We use Xi.j to represent the variable-value 
pair Xi = j. Let C be a binary constraint. We write (j, k) G C if C allows the tuple 
(j, k). Consider a soft binary constraint C. We denote the cost of the tuple c(j, k). If 
(j, k) G C then c(j, k) = and c(j, k) = 1 otherwise. Given two sets of integers 
S and R, we denote S tfcl R = {s + r | s G S, r G R}. Given a constant c, we write 
S W c = {s + c | s € S}. We denote I[X] an instantiation of the variable sequence 
X = [Xi, . . . , X n \. 

3 The SeqBin constraint 

The SeqBin meta-constraint ensures that a binary constraint B holds down a sequence 
of variables, and counts how many times another binary constraint C is violated. 

Definition 1. Given an instantiation I[N, X\, . . . , X n ] and binary constraints B and 
C, the meta-constraint SEQBlN(iV, X, C, B) is satisfied if and only if for any i £ [1, n— 
1], (/[Xj], /[Xj+i]) G B holds, and I[N] is equal to the number of violations of the 
constraint C, (/[JQ], /[Xj+i]) ^ C, in I[X] plus 1. 

Note that we add 1 for consistency with the definition of SeqBin in (TJ. 

Example 1. Consider the SEQBlN(iV, [Xi, . . . , X 7 ],C, B) constraint where N = {3}, 
B is TRUE and C(JQ, Xj) is a monotone constraint with one satisfying tuple (1,1) G C, 
Dpfi) = #(^3) = ,0(^5) = D(X 7 ) = 1 and D(X 2 ) = D(X 4 ) = D(X 6 ) = 
{0, 1}. Consider an instantiation I[N = 3,Xi = 1,X 2 = 0,X 3 = 1, . . . , X 7 = 1]. 
The constraint C is violated twice: (X\ = 1, X2 = 0) and (X2 = 0, X3 = 1). Hence, 
the cost of the assignment is ^ = 2 + 1 = 3. □ 

A number of global constraints can be used to propagate SeqBin including 
Regular M7I8L cost Regular 0, CardPath and Slide Q. However, all are 
more expensive than the propagator proposed here. A thorough analysis of related work 
is presented in [1 1. We will assume that, as a preprocessing step, all binary constraints 
B are made DC which takes just 0(nd) time for monotone B. We say that an instanti- 
ation I[X] is S-coherent iff (I[Xi\, I[X i+1 ]) G B, i = 1, . . . , n. A value v G D(Xi) 
is ^-coherent iff there exists a B-coherent instantiation I[X] with I\Xj\ — v. 

3.1 A graph representation of S eqB in 

We present a connection between finding a solution of the S EQB IN constraint and the 
problem of finding a path of a given cost in a special n-partite graph where the cost of an 



edge is either or 1. We start with a description of the graph G(V, E). For each variable- 
value pair Xij we introduce a vertex in the graph that we label X{j, V — {xj t j\i — 
1, . . . , n, j € D(Xi)}. For each pair (Xij, Xi+i lV ) we introduce an edge iff the tuple 
(j,v) E B, hence, E = {(x iyj ,x i+ i tV )\i = 1, . . . , n - 1, j € D(Xi), v G D(X l+1 ) A 
(j, v) <G B}. An edge (j, v) is labeled with c(j, v). Note that vertices Xi.j, j € D(Xi), 
form the ith partition as they do not have edges between them. Moreover, there are edges 
only between neighbor partitions i and i + 1, i = 1, . . . ,n — 1. Hence, the resulting 
graph is a special type of n-partite graph that we call layered. To keep the presentation 
clear, we introduce dummy variables Xq and X n+ i with a single vertex 0*, and edges 
from £o.o* to a ^ vertices (values) of X\ with cost 1, and from all vertices of X n to 
Xn+i.o* with cost 1. To simplify notation, we label a vertex x\ j that is at the ith layer 
simply as j in all figures. We also use solid lines for edges of cost zero and dashed lines 
for edges of cost one. As variables correspond to layers in the graph we refer to layers 
and variables interchangeably. Similarly, as variable-value pairs correspond to vertices 
in the graph we refer to vertices at the ith layer and values in D(Xi) interchangeably. 
Given two values j at the ith layer and v at the (i + l)th layer we say that j/v is a 
support value for v/j iff there exists an edge (j, v) in the graph. 
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Fig. 1. A 9-partite graph that corresponds to the SeqBin constraint from Example [T] Dashed 
edges have cost one and solid edges have zero cost. 



Example 2. Consider the SEQBlN(A r , [X\ , . . . , X7] , C, B) constraint from Example[T] 
Figure [T] shows the corresponding graph representation of the constraint. □ 

We now describe an algorithm, PathDP to find a path of a given cost in a lay- 
ered graph. PathDP is a special case of the dynamic programming algorithm for 
the knapsack problem where all items have unit costs. Both the existing propagator 
for SeqBin and our new one are specializations of PathDP. Another specializa- 
tion of PathDP is the propagator for cost Regular (9). We denote by c(X) the 
set of all possible numbers of violations achieved by an assignment to X: c(X) = 
{k j I is B -coherent A c(I) = k} and similarly c(xij) = {k | / is B-coherent A 
I[Xi] — j A c(I) = k}. We denote the forward cost from the variable Xi to X n by 



Cf(x it j) = {k | I[Xi, . . . ,X n ] is B-coherent A I[Xi] = j A c(I) = k}. This set 
contains all the distinct costs that are achievable by paths from the vertex Xij to the 
vertex x n+ i,o*. We write IbAxij) — min(c/(xi J )) and ubt(xij) — max(c/(xjj)). 
Similarly, we denote the backward cost from the variable X\ to Xi by Cb(xij) = {k \ 
I[Xi, . . . , Xi] is incoherent A I[Xi] = j A c(I) = k}. It contains all the distinct costs 
that are achievable by paths from the vertex xo,o* to tne vertex Xij. We denote by 
lbb(x it j) = mm(c b (xij)) and ub b {x itj ) = max(c fc (o; JJ )). 



Algorithm 1 The pseudocode code for the PathDP algorithm 

1: procedure PathDP( G(V, E)) 

2: fori = n ->• 0; j £ D{Xi) do 

3: c f (xi,j] = 

4: forfc e D{X i+1 ),(j,k) G -Bdo 

5: Cf(x i]:j ) = Cf(xij) U ( C/ (a: i + i,fc) l+J c(j, k)) 

6: for i = l-+n + l;j'e -D(Xi) do 

7: Ci,(a:i,j) = 

8: farfc £ D(Xi-x), {k,j) £ Bdo 

9: c b (xij) = ci,^,,) U (ci,(xi_i, fc ) i+j c(k,j)) 

10: for i = -y n + 1: j g do 

11: c(x i; j) — Cf(xij) 1+1 Cb(a^i l3 -) 



> Compute the forward cost 



t> Compute the backward cost 



> Compute the total cost 



PathDP performs two scans of the layered graph, one from X n to X\ to compute 
forward costs, and one from X\ to X n to compute backward costs. The backward pass 
processes one layer at a time and computes the set Cf(xij) for each variable Xi and 
value j £ D{Xi) (lines [2]-[5]i. Dually, the forward pass computes for each variable 
and value j e the backward cost c (xj j)(lines[6j|9|. Finally, for each vertex the 

set of costs achievable on paths from xo.o* to a;„+i.o* that pass through Xij is Cf(xij) ] S 
Cb(xij). To match the semantics of SeqBin, we compute Cf(xi.j) l±J Cb(xij) W (—1) 
for each vertex. 

The time complexity for SeqBin using PathDP is 0(n 2 d 2 ) : the number of dis- 
tinct costs is at most n, so getting the union of two cost sets takes 0(n) time. Each 
vertex has at most d outgoing edges, so the set Cf(xij) can be computed in 0(nd) 
time for each X{ j. There are 0(nd) vertices in total, giving the stated complexity of 
0{n 2 d 2 ). 

Example 3. Consider the SEQBlN(iV, [X\ , . . . , Xf] , C, B) constraint from Example[T] 
Figure [T| shows the forward cost Cf(xi.j), the backward cost Cb(xij) and the total cost 
Cf(xij) tt) Cb(xij) 1+1 ( — 1), j E D(Xi), i = 0, . . . , 8 in gray rectangles. We have one 
rectangle for each variable-value pair Xj = j. Consider, for example, the vertex '1' at 
layer X$. We compute the forward cost Cf{x^^) = (c/(a;6,o) W c(l, 0)) U (c/(a;6,i) W 
c(l, 1)) = {3} U {1} = {1, 3} and the backward cost 0,(2:5,1) = (c 6 (x 4 ,o) W c(0, 1)) U 
(0(0:4,1) W c(l, 1)) = {3, 5} U {1, 3} = {1, 3, 5}. Then c/(x 5 ,i) W c 6 (x 5 ,i W (-1) = 
{1,3} W {1,3,5} W(-l) ={1,3,5,7}. □ 



Lemma 1. Lef G(y, E 1 ) a layered graph constructed from the SeqBin(7V, X, C, B) 
constraint as described above. There exists a bijection between B-coherent assignments 
I[X] of cost s and paths in the graph G{V, E) of cost s + 1. 



3.2 Revisiting SeqBin 

A domain consistency algorithm for the SeqBin(7V, X, C, B) constraint, SeqBinAlg 
was proposed in iffl under the restriction that B is a monotone constraint. In this section 
we identify two drawbacks of this algorithm that make it incomplete. We show that 
SeqBinAlg does not detect bounds disentailment and it is not idempotent even if B is 
a monotone constraint. It was observed idependently in [ 1 1 that SeqBinAlg does not 
enforce DC. However, the authors do not explicitly explain the source of the problems 
of SeqBinAlg and only identify a very restricted class of SeqBin instances where 
SeqBinAlg does enforce DC. 

We will identify the main reason that SeqBinAlg fails to enforce DC. This is 
important to develop a new algorithm that does enforce DC in 0(nd 2 ) time when 
B is monotone. SeqBinAlg uses Algorithm [T] to compute only the lower and up- 
per bounds of the forward and backward cost (Lemma 1 and 2 in HI). Namely, us- 
ing the notations in [1], we compute s(xi,j) = lb(cf(xij)), s(xij) = ub(cf(xij)), 
p(xij) = lb(cb(xij)) and p(xij) — ub(cb(xij)) in 0(nd 2 ). SeqBinAlg is based on 
these values and runs in 4 steps 0]: 

Phase 1 Remove all non incoherent values in D(X). 

Phase 2 For all values in D(X), compute s(xi j),s(xij),p(xi j) and p(xij). 
Phase 3 Adjust the min and max value of N with respect to s(X) and s(X). 
Phase 4 Using the result of Phase 3 and Proposition 4 [1 1, prune the remaining B- 
coherent values. 

The correctness of SeqBinAlg relies on Proposition 3. Unfortunately, this propo- 
sition is not correct, and the algorithm is consequently incomplete. 

Proposition 3 (in |1|). Given an instance of SeqBin(7V, X,C, B) with monotone 
B, SEQBm(N, X,C,B) has a solution iff \s(X),s(X)] n N ^ where s(X) = 
min je£ ,( Xl ) s(xij) ands(X) = max jeD ( Xl ) s(xij). 

Issue 1. Bounds disentailment. 

Lemma 2. The algorithm SeqBinAlg for SeqBin^, X, C, B) with monotone B 
does not detect bounds disentailment. 

Proof. Consider the SeqBin(A^, [Xi, . . . , X 7 ],C, true) constraint in Example[T] The 
constraint TRUE is monotone. Consider N = 4. Then s(X) = 1 and s(X) — 7. Hence, 
[1, 7] n {4} ^ 0. However, there is no solution with cost 4. The problem with the proof 
of Proposition 3 in [ 1 1 is the last sentence which claims that there is a solution for each 
value k G [s(xi ;K ), s(xi tV )] for some v. This is not true as Example [T| demonstrates. 
Note also that [s(xij) + p(xij),s(xij) +p(xij)] D {4} ^ 0. Hence, according to 
Proposition 4 HI each variable-value pair is DC which is also incorrect. □ 

Issue 2. Idempotency. As a consequence of not detecting bounds disentailment, 
SeqBinAlg is also not idempotent. 

Lemma 3. The filtering algorithm SeqBinAlg/ot" SEQBlN(A r , X, C, B) with mono- 
tone B is not idempotent. 
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Fig. 2. A 7-partite graph that corresponds to the SeqBin constraint from the proof of Lemma[3] 
Dashed edges have cost one and solid edges have cost zero, (a) shows initial costs; (b) shows 
costs after X 3 = is pruned. 



Proof. Consider the SEQBlN(iV, [X x , . . . , X 7 ], C, B) where = {3}, B = 
k)\j, k e [0, 3], (j, k) g (0, 0), (1, 0)}. C(Xi, X 3 ) is a monotone constraint with 
three satisfying tuples (2, 0), (0, 2), (0, 3) € C. Finally, D(Xi) = {0} 1 D(X 2 ) = 
{1,2},D(X 3 ) = D{X 4 ) = {0,2} and D(X 5 ) = {3}. Figure^a) shows the graph 
representation of the example. Note that 0(0:3,0) H N = {2} n {3} = 0. Hence, the 
value is pruned from D(X^). Therefore, the value X4 = 2 loses its support with cost 
2 (Figure^b)). The new cost of £4.2 is {4, 5} n N — and the value 2 is pruned from 
D (X4). Note that the removal of X4 — 2 triggers further propagation as Af 2 = 2 loses 
its support of cost 5, and 2 is removed from D(X^) at the next step. □ 

We note that if B is not monotone, SeqBinAlg may need 0(n) iterations to reach 
its fixpoint and Proposition 2 in HI only works if B is monotone. 

Remedy for SeqBinAlg. As seen in Lemmas |2j|3] the main cause of incomplete- 
ness in SeqBinAlg is that the set of costs for each vertex is a set rather than an 
interval even when B is monotone. One way to overcome this problem is to restrict 
SeqBin( N, X, C. B) to those instances where it is an interval. This approach was taken 
in 1 10 1 where SEQBlN(A r , X, C, B) was restricted to counting-continuous constraints. 

Definition 2. The constraint SeqBin(A^, X, C, B) is counting-continuous if and only 
if for any instantiation I[X] with k stretches in which C holds, for any variable Xj € X, 
changing the value of Xi in I[X] leads to k, k + 1, or k — 1 violations. 

This restriction ensures that the structure of the cost for each variable-value pair 
is an interval and, indeed, the filtering algorithm SeqBinAlg enforces DC. However, 
this approach has a number of drawbacks. First, restricting SeqBin(A^, X, C, B) to 
counting-continuous with monotone B excludes useful combinations of B and C. Ex- 
ample [I] shows that SEQBlN(Af, X, C is monotone, B is true) does not satisfy this 
property. Secondly, many practically interesting examples [ 1 ] that can be propagated 
in 0(nd) time do not satisfy these conditions. As was observed in iflOl . constraints 
Change^' 5 ^ = SeqBin(A^, X,C e {=, true) and Smooth are not counting- 
continuous. The INCREASINGNVALUE constraint which is SeqBin(A^, X, =, <) vio- 
lates the condition that B is monotone. The only remaining constraint that satisfies these 



restrictions on B and C is Change^'-* = SeqBin(7V, X, C e {<,<}, true). 
Unfortunately, the proof relies on the claim that C is monotone, which is false for 
C e {<, <}. Thirdly, we do not currently have a test to check if SeqBin(7V, X, C, B) 
is counting-continuous. Despite the problems pointed out above, the filtering algorithm 
SeqBinAlg enforces DC on IncreasingNValue and Change(C € {<, <}) in 
0(nd) as the counting-continuous property together with the row and column convex- 
ity of C are sufficient to achieve this complexity. 

In this work we take a different approach. We focus on an extension of the algorithm 
to handle non-interval cost sets. The challenge is to perform this extension in 0(nd 2 ) as 
the generic dynamic programming algorithm PathDP that handles sets natively runs 
in 0(n 2 d 2 ) time. Note that if the cost structure is an unrestricted set of values then 
the time complexity of PathDP is going to be hard to improve as it is a specialization 
of a well-studied dynamic programming algorithm for the knapsack problem where all 
items have unit cost. Hence, we show that the structure of the costs for a variable-value 
pair is restricted if B is monotone. This allows us to perform union operations on sets 
in O(l) time rather than 0(n). 

3.3 Cost structure 

We show that the structure of the cost for each variable-value pair is restricted. First, 
we introduce definitions to formalize the structure of forward and backward costs. 

Definition 3. A set S is a zipper set if it can be obtained from an interval [a,b] by 
removing all odd or all even values. We denote a zipper set as [a ~ b]. 
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Fig. 3. A 10-partite graph that corresponds to the SeqBin constraint from Examples[4j|6] Dashed 
edges have cost one and solid edges have zero cost. 



Note that in a zipper set [a ~ b], a and b have the same parity. If both are odd, 
[a ~ b] is an odd zipper set, while if both are even, [a ~ b] is an even zipper set. 

Definition 4. A set S is an i-zipper set if it can be written as [a ~ b] U [b, c] U [c ~ d], 
a < b < c < d. We denote an i-zipper set as [a ~ b — c ~ d]. If a = b, we write the set 
as [b — c ~ d] and if c — d we write it as [a ~ b — c]. 



Given an i-zipper set [a ~ b] U [b, c] U [c ~ d), we denote the left part [a ~ b] as 
I ■ zip, the middle part [b — c] as i ■ val and the right part [c ~ d] as r ■ zip. 



Example 4. Consider the SeqBin(N,[X 1i . . . ,X 8 ],C,B) constraint that Figure [3] 
presents. We only show the forward cost sets. For example, the forward cost set c t (s 3 5 ) 
is a zipper [1 ~ 5], Cf(x$,4) — [2 ~ 4] is an even zipper and 0^(2:3 5) = [1 ~ 5] is an 
odd zipper. An example of an i-zipper set is 0/(2:1,3) = [1^5 — 8]. □ 

Our filtering algorithm is based on the following theorem. 

Theorem 1. Consider a SeqBin(7V, X, C, B) constraint with monotone B and ar- 
bitrary C. Let [b,c], c > b be the maximal interval such that [6, c] C Cf(x iv ), 
i = 1, . . . , n, v G D(Xi). If such an interval does not exist we define [b, c] = 0. Then 
the following holds for any value j, k, {j, k} € D(Xi) and i = 1, . . . , n: 

1. Uniqueness. The set c/(x( u ) is either a zipper or i-zipper set. 

2. Overlapping. Ifcf(Xij) and Cf(xi t k) are i-zipper sets, Cf(xij) = [a ~ b — c ~ d] 
and Cf(xi t f.) = [s ~ r — q ~ t], then [b, c] n [r, q] 7^ 0. 

3. Structure. 

• Bounded holes. If Cf(xij) is an i-zipper set, [a ~ b — c ~ d] then b — a < 4 
and d — c < 4 

• Closeness. \lbf(xij) — lbf{x^k)\ < 2 and \ubf(xi,j) — vbf(xi t k)\ < 2. 

Theorem[T|shows that the structure of Cf(xij), j € D(Xi) is limited to few distinct 
structures of sets: a zipper and an i-zipper. This allows us to deal with such restricted 
sets efficiently. We give an overview of the proof. We identify two key properties of 
the problem. The first property is that for all but at most two layers the cost structure 
is homogeneous. All costs Cf(xij) are either zippers or i zippers. Moreover, layers that 
only contain zippers (i zippers) are consecutive. The layers [712, . . . ,n] only contain 
zippers for some n-z- The layers [1, . . . , nx\ only contain i zippers for some ri\ < n-x- 
There are at most two heterogeneous layers between these sequences. 

Example 5. Consider Figure [3] We only show the forward cost Cf(xij) for each 
variable-value pair in a gray rectangle. The homogeneous consecutive layers [na = 
4, . . . , n = 8] only contain zippers. The two heterogeneous consecutive layers [2, 3] 
contain zippers and i- zippers. The homogeneous consecutive layers [0, ri\ = 1] only 
contain i- zippers. □ 

The second property is that if we consider all cost sets at one layer then their 
lower(upper) bounds are at most distance two from each other. This is stated as the 
closeness property of the structure in Theorem[T] S ection [3~4| proves the first property 
and Section [33] proves the second property. The rest of the proof of Theorem [T] uses 
induction on the number of layers, taking these properties into account. These proofs 
are in the Appendices. Appendix C.l proves Theorem[T]for the sequence of layers that 
only contain cost sets that are zippers. Moreover, it imposes an additional property on 
the structure of zippers. Appendix C. 2 proves Theorem [T] for the two heterogeneous 
layers. This is the most tedious part of the proof using enumeration of all possible dis- 
tinct structures of the forward(backward) cost. This enumeration is feasible because of 
the properties of the cost structure in the first sequence. Appendix C.3 proves The- 
orem [T] for the last sequence that only contains i zippers. We show that no new cost 
structures may appear in this sequence. Overall, we prove that there are a bounded 
number of cost structures at each layer. 



3.4 Partitioning of layers 



The proof of Theorem [T] is based on the following lemma that partitions variables 
X\ , . . . , X n into three groups based on the structure of the forward costs (the back- 
ward costs are similar, but the partition may be different). 

Lemma 4. Consider a SeqBin(./V, X, C, B) with monotone B and arbitrary C. Let 
X t = j be the first variable in the reverse order of variables such that there exists a 
value j andan interval [a, b], a < bsuch that [a, b] C Cf(xtj), i.e., for all if G n], 
j € D{X t '), there does not exist [a', b'], a' < b', such that [a', b'] C Cf(xt'j). Then 
for all Cf(x a j), j G D(X a ), s£ [l,t — 2], there exists an interval [a s j, b s j] such that 
[a s ,j,b Stj ] C c f (x s j). 

Proof. Consider the pair of variables X t and X t _\. We recall that we consider variables 
in the reverse order form n to 0. Let v be the maximum value in the total order it such 
that v G D(X t -i). By the monotonicity of B and the fact that B(X t -\,X t ) is DC, 
we conclude that (v,j) G B. Otherwise, if (v,j) ^ B, the value j had to be pruned 
from D(Xi) by enforcing DC on B(X t -i, X t ) as v is the top value in the ordering in 
D(Xt-i). Therefore, there exists an interval [c, d] G {[a, b], [a + 1, b + 1]}, c < d such 
that [c,d] C c/(^i_i,„). 

Consider the pair of variables X t ~\ and X t _2- Due to monotonicity of B we know 
that (k,v) G B, q G D(Xt-2) as u is the top value in 7r such that u G D(X t -i). 
Hence, v is a support for all fc and Cf(xt-2,k) must contain an interval as Cf(xt-2,k) = 
U u , e _D(A' t _ 1 ) {cf (x t -i,w) W c(j,w)) and [c, d] C c/(a;t-i,t<), u G £>(X t _i). Hence, 
there exists an interval [c',(i'], c' < <i' such that [c',cf] C Cf(x t ~2,k) for all k G 
Z3(X t _2) including the top value in the ordering tt, fc', such that k' G D(X t -2)- We 
repeat the argument for layers s, s G [1, ...,< — 3]. □ 

Corollary 1. Consider a SeqBin(A^, X, C, B) with monotone B and arbitrary C. 
Then there are three blocks of consecutive variables [Xi,X ni ] U [X ni+ i, X n2 _{\ U 
[X n2 ,X n ] with n\ < n% < ri\ + 3, i.e., the size of the partition [X ni +i, X n2 —i] is at 
most 2, and: 

Zipper block. For all i,j, i G [/i2,n], j S D(Xi), there does not exist an interval 

[a, b] C [1, Tr top ], a < b, such that [a, b] C Cf(Xij). 
Zipper + i-Zipper block. There exist i,j, i G [«i + l,n 2 — 1], j G D(Xi) and an 

interval [a, 6] C [1, ir top ], a < b, such that [a, b] C Cf(xij). 
i-Zipper block. Foralli,j, i G j G -D(Xi) there exists an interval [a,b] C 

[1, 7r* op ], a < b, such that [a, 6] C c/(xij). 

Example 6. Consider Figure |5] The zipper block includes [Xi, . . . , Xg]. The zipper + 
i zipper block includes variables X 2 and X 3 . The i zipper block contains Xt. □ 

3.5 Closeness of costs 

We show that if B is a monotone constraint then the forward cost of the values of 
a variable cannot deviate too much from each other. Hence, we prove the closeness 
property of the cost structure in Theorem [T] 



Lemma 5. Consider a SeqBin(./V, X, C, B) with monotone B and arbitrary C. Con- 
sider a variable X; L , i = [1, . . . n]. Then for any two values j, k G D(Xi), j -< k, either 

ubf{x id ) e [ubf(x itk ),ubf(x itk ) + 1] orub f (x. hk ) G [ubf(x i j),ubf(x i j) +2]. 

Proof. By induction on the distance from n. The base case is trivial, as ubf(x n ^) = 
lbf(x n ,i) = 1 for all i. Suppose this holds for all X t +i, . . . , X n . We show that it holds 
for X t . Let v be a value such that ubf(xt.k) — ubf(xt+i, v ) + c{k, v) and wbea value 
such that ubf(xt,j) — ubf(xt+i, w ) + c(J, w). 

Property 1. If w = v or ubf(xt+i lV ) — ubf(xt+i, w ) then lemma holds. Proof: This 
follows from the assumption that all costs in C are zero or one. Hence, \ubf(xtj) — 
ubf(x tik )\ < 1. 

Property 2. The tuple (k, w) G B. Proof: This follows from monotonicity of B and the 
assumption that j -< k and from (j, w) G B. 

Property 3. If w -< v then (j, v) G B. Proof: This follows from monotonicity of B and 
the assumptions w -< v and (j, w) G B. 

Property 4. If (j, v) G B then w — v. Proof: In this case the bipartite 
subgraph over four vertices k,j,v,w is complete (Figure Qa)). Hence, v' = 
argmax w . v (ubf(xt+i,v),ubf(xt+i. w )) is a potential support for both ubf(xtj) and 
ubf(xt,k) and w and v coincide. 

From Properties 1-4 we know that we only have to prove Lemma in the following 
case: v -< w, v ^ w, ubf{x t +i. v ) ^ ubf(x t+ i tW ) and (j,v) £ B. 

By the induction hypothesis, there exist two cases: ubf(x t +i, v ) G 
[ubf(x t+ltW ),ub f (x t+liW ) + 1] or ubf(x t+1>w ) G [ub f (x t+1 , v ),ub f (x t+1 . v ) + 2}. 

Case 1. We assume ubf(x t+ i, v ) G [ubf(x t+ i tW ),ubf(x t +i, w ) + 1]. As 
ubf(x t +i,v) ^ ubf(x t+ i }W ) we know that ubf(x t +i, v ) = ubf (x t+ i, w ) + 1. We de- 
note p = ubf(xt+i }W )- Figure^b) shows this case. Note as costs of the edges are zero 
or one, ubf(x ty k) G {p + l,p + 2}. On the other hand, ubf(x t j) G {p 7 p + 1}. Hence, 
ubf(x t ,k) G [ubf(xtj),ubf(xtj) + 2] as required. 

Case 2. We assume ubf(xt+i, w ) & [ubf(%t+i,v)> ubf(%t+i,v) + 2] (Figure |4] (c)) 
and since ubf{x t+ i v ) ^ ubf(x t +i, w ), ubf(x t +i :W ) > ubf(x t+ i v ). As v -< w the 
value w is a support value for both j and fc. Hence, either v — w or ubf(x t+ i. v ) — 
ubf(xt+i, w ). This contradicts v ^ w and ubf{xt+x,v) ^ ubf(x t +i, w ). □ 



Xt x t+1 

k v^^> w, ub f (x t+1 ,„ ) 
j * — ■> v, ubf (x t+ i,» ) 

(a) 


X t X t+ i 

w - ub < ( x <+ 1 .» >= P 
j . v, ubf (x t+ i,v )= p+1 

(b) 


Xt X t +i 
k ■ » w, ub, (Xt + i,«, )= p+2 

j/N v, ubf (x t+ i,„ )= p 

w 


Xt X t+ i 

k^— ^w, lb, (x t+1 , w ) 
i v, lb, (x w ,„) 

(d) 


Xt Xt+i 

k v » w, Ibf (x,+i, w )= p+1 

j ^\ v, lbf(x, +1 , v )=p 

(o) 


Xt X t +i 

k v ^ w . Ib f ( x m,w )= P 

j v, lbf(x,+i, v )=p+2 



Fig. 4. Computation of the forward cost upper bound, ubf(xij), (a)-(c) and the forward cost 
lower bound, Ibf(xi.j), (d)-(f). Note that we do not distinguish between and 1 cost edges. 



Lemma 6. Consider a SeqBin(A^, X, C, B) with monotone B and arbitrary C. Then, 
either lb f(x itj ) € [lbf(x itk ), lbf(x ijk ) + 2] orlb f (x iyk ) <E [lb f (x iij ), lbf(x itj ) + I] for 
all variables Xi and values j -< /c. 

Proof. Analogous to Lemma [5] (Figure [4] (d)-(f)). □ 

We omit the rest of the proof here due to space limitation (see Appendix B-C ). We only 
mention Appendix C. 1, Lemma 15 that refines Theorem[T]for layers in the zipper block 
as we use this result in Section [4] Lemma 15 shows that at the ith layer in the zipper 
block, i G [m + 3, n], there are at most 4 possible distinct sets Cf(xij), j € D(Xi). 

3.6 Total cost 

Lemma 7. Consider a SEQBlN(A r , X, C, B) constraint with monotone B and arbi- 
trary C. The set c(xij) = Cf(xij) 1+1 ct(xi,j) 1+1 (—1), j G D(Xi), i — 1, . . . ,n is 
either a zipper or an i-zipper set. For any i-zipper set c(xij) — [a ~ b — c ~ d] it holds 
b — a < 4 and d — c < 4. Moreover, c(xi.j) can be computed in 0(1) time. 

Proof. It is sufficient to consider c(xi,j) — Cf(xij) 1+1 Cb(xij) as a shift by a constant 
does not change the structure of the set. As Cf(xi.j) and Cb(xij) satisfy TheoremJTJ 
they are either zipper or i-zipper sets. We consider 3 cases. 

Case 1. Both Cf(xij) and Cb(xij) are zipper sets. Consider zipper sets Cf(xi j) = 
[a ~ b] = {a, a + 2, . . . , b} and C) J (x i j) — [c ~ d] = {c, c+ 2, . . . , d}. Then c(xij) = 
{a + c, a + 2 + c, . . . , b + c, b + c+2, . . . , . . . b + d] = [(a + c) ~ (6 + d)]. 

Case 2. Both Cf(xi^) and ci,(xij), are i-zipper sets. Consider Cf(xij) = [a ~ 
b — r ~ and ct,(xij) = [c ~ rf — / ~ e]. We consider the most general case where 
a < b,r < q, c < d and / < e. 

We perform the operation l+l in three steps, c(xij) = c 1 U c 2 U c 3 where c 1 = 
c/^j) W [d,/] = [a + d,g + /], c 2 = c/(xi,j) W [c ~ d] = [(a + c) - (b + 
d)] U [6 + c,r + d] U [(r + c) - (6 + q)}, and c 3 = Cf(x itj ) l±l [/ ~ e] = [(a + 
/)-(& + e)] U [b + f, r + e] U [(r + /) - (e + g)]. As (6 + c) < (6 + d) and 
(r + c) < (r + d) we get c 2 = [(a + c) ~ (6 + c) — (r + d) ~ (& + g)]. Similarly, we 
get c 3 = [(a + /) ~ (6 + /) - (r + e) ~ (e + q)}. 

Finally, c{xi t j) — c 1 Uc 2 Uc 3 = [(a+c) ~ (min((6+c), (a+d))— max((r+e), (g+ 
/)) ~ (e + g)]. Consider the value min((6 + c), (a + d) — (a + c)). If b + c < a + d then 
we have (b + c) — (a + c) = 6 — a < 4. If a + d < b + c then we have (a + d) — (a + c) = 
d — c < 4. Similarly, we prove the result (e + q) — max((r + e), (g + /)) < 4. Hence, 
the statement of the lemma holds. 

Case 3. Exactly one of {cf(xij), Cb(xij)} is a zipper set. Similar to Case 2. 

Complexity. In all three cases above, the proof is constructive and we give an ana- 
lytic expression to compute c(xij). Hence, this can be done in 0(1) time. □ 

Example 7. Suppose Cf(xij) = [2^6 — 8^ 12] and Cb(xij) = [10 ~ 16 — 20 ~ 22]. 
Both Cf(xij) and Cb(xi j) are i-zipper sets. Hence, to compute c(xij) = (cf(xij) l+l 
c 6 (x iiJ -)W(-l)) we use the expression [(2 + 10) - (min((6+10), (2 + 16)) -max((8+ 
22), (12 + 20)) ~ (12 + 22)] W (-1) = [11 ~ 15-31 ~ 33]. □ 



4 Domain consistency algorithm 



In this section we present SeqBinAlgNew, a domain consistency algorithm for 
SeqBin(7V, X, C, B) with monotone B. It has the same structure as SeqBinAlG: 

Phase 1 Remove all non B -coherent values in the domains of X. 
Phase 2 For all values in the domains of X, compute Cf(xij) and Cb(xij). 
Phase 3 Prune the domain of N with respect to c/(xo,o* )■ 
Phase 4 Prune the remaining B-coherent values. 

The main complexity bottleneck is Phase 2 and Phase 4. If we do not put any restric- 
tions on B and C then it takes 0(n 2 d 2 ) in total to compute these sets. We show that the 
complexity of SeqBinAlgNew decreases as we put restrictions on constraints B and 
C. With respect to phase 3, we note that the cardinality of both D(N) and c/(xo,o*) is 
at most ri, so their intersection can be computed in time 0(n). 

4.1 Domain consistency algorithm in 0{ndP) with monotone B 

Phase 2 o/SeqBinAlgNew. We exploit the structure of the costs established by The- 
orem[T]to improve PathDP (Phase 2). We show that lines[4}|5]and[&|j9]can be done in 
0(d) time if B is monotone. 

Lemma 8. Consider a SEQBlN(iV, X, C, B) constraint such that B is monotone. For 

all j G D{Xi), i € [1, . . . , n], Cf(x itj ) = \Jv&D(x i+1 ) ( c f( x i+i,v) W c(j, v)) can be 
computed in 0(d) time. 

Proof. We partition all supports v into two groups based on the value of c(j, v). The first 
group So contains values such that c(j, v) — and the second group Si contains values 
such that c(j,v) = 1. We find c 1 = \J v es c f( x i+i,v) and c 2 = LUsi c f( x *+i,v)- 
Then we find c/(xij) — c 1 U (c 2 l+J 1). We prove the lemma for c 1 (c 2 is analogous.) 

Compute c . We assume that p is the smallest lower bound among the forward cost sets 
of the values in Sq and q + 2 is the greatest upper bound: p = min„ e g lbt(xi+i v ) and 
q + 2 = max.„ e s (l ubf (x i+ i v ). We refer to I ■ zip of Cf(xi + i_ v ) as I ■ zip(x i+l v ) to 
simplify notation (similarly, for the other two parts i ■ vol and r • zip). By Theorem[T] 
we know that lb(l ■ zip(xi+\ >v )) G [p,p + 2], ub(r ■ zip(Xi+i tV )) G [q, q + 2], lb(i ■ 
val(x.i + i, v )) £ [p,p + 6] and ub(i ■ val(xi+i tV )) G [q — 4, q + 2]. Hence, we compute 
the 20 indicator values J l y zlp (v), y G [p,p + 2], J r y zip (v), ye [q,q + 2], J l y vallb (v), 
y G [p 7 p + 6], and J y ' valub (v), ye [q — 4, q + 2], v € So- For example, we define 
J l y"*(v) = 1, iff lb(l ■ zip(x l+l . v )) - y and J^(S ) - niax weS „ J l y ztp (v), y G 
[p,p + 2]. Similarly, we compute the other 19 indicators. This can be done in 0(d) time 
with a linear scan over Cf(xi+\. v ), v G Sq. Then we can compute (JueSn c /( a -*+i,f) = 
[a* ~ b* — c* ~ d*} in 4 steps, each of which takes 0(1) time. 

Union ofi ■ val. Theorem[T]shows that all i ■ vol sets must overlap. Hence, the union 
of i ■ val(xi+ij) forms an interval. We find the minimum value y, y G {p, . . . ,p + 6} 
such that J y vallh (Sq) = 1. If such a value y exists then we set b* — y. Then we find the 



largest value y' E {q - 4, . . . , q + 2} such that J l ? alub (So) = 1 and set c* = y'. Note 
that if y exists then y' exists. If y does not exist we know that all Cf(xi + i tV ), v E So are 
zipper sets and we set b* = c* = 0. 

Union ofl-zip. Suppose b* ^ 0. We find indicators Jy Zlp (So), y E [p,p+2], that are 
set to one. Set p' to the minimum among [p, p+2), for which there exists J p , Zip (So) = 1. 

If Jp+l p (5 ) = 1 and J l p zip (S ) = 1 or J p +f(S ) = 1 or b* E {p,p + 2} then set 
a* and reset b* , so that a* = b* = min(p + l,p') otherwise set a* = p' and leave b* 
unchanged. Union of r ■ zip is similar to union of I ■ zip. 

Union of zippers. Suppose b* = 0. Then we determine which of 4 distinct sets 
(Appendix C.l, Lemma 15) are present among c^(xi + i iV ), v E So- As there are at most 
4 such that are zippers we can union them in 0(1) time and identify the values a* , b* , c* 
and d* . 

We can compute c 1 U (c 2 W 1) in O(l). We omit the proof here due to space consid- 
erations (see Appendix D, Lemma 19). 

Complexity. For each j g D(Xi), i g [1, . . . , n], the forward cost set Cf(xij), can be 
computed in 0(d). As we have 0(nd) such sets, the total time complexity is 0(nd 2 ). 
One way to reduce this complexity is to compute Cf(xij) in O(l). □ 

Corollary 2. Phase 2 of the algorithm SeqBinAlgNew runs in 0{nd 2 ) time. 

Phase 4 o/SeqBinAlgNew. We present the final phase of SeqBinAlgNew. 

Lemma 9. Consider a SEQBlN(iV, X, C, B) constraint such that B is monotone. For 
each i £ [1, . . . , n], the total time complexity to compute c(xij) D D(N) ^ 0, j G 
D{Xj), is 0{d). The total time complexity of Phase 4 is 0(nd). 

Proof. Preprocessing of D(N). We use a preprocessing step to compute cumulatively 
sums s° dd and s e v ven to collect information about the presence of odd and even values 
in D{N). Hence, s° dd = 0, s]f x = s° dd + (j e D(N) A j is odd ),j'6 [1, . . . , tt*°p]. 
Similarly, we compute s| t,era .This can be done in O(d). Then the value s° dd — s° dd 1 
shows how many odd values of D(N) are in the interval [j2, ji]- 

Performing the check. By Lemma|7]we know that c(xi.j) is either zipper or i zipper. 
If c(xij) is an even zipper set [a ~ 6] we check if s^ ven — s^1 e " ^ 0. If so the variable- 
value pair Xi = j is supported. Similarly, if c(xij) is an odd zipper set. Suppose c(xij) 
is an i-zipper set [a ~ b — c ~ d}. Then, we can check separately whether each of three 
parts [a ~ b] U [b — c] U [c ~ <f] has an intersection with D(N) using the cumulative 
sum values. Hence, the check can be done in O(l) time. There are O(d) sets c(xij), 
j E D(Xi). Hence, the total time complexity of one layer is 0(d). 

Complexity. The graph has 0(n) layers. So, the total time complexity is 0(nd). □ 

4.2 DC algorithm with monotone B and row and column convex C 

Finally, we show that if C is row and column convex then SeqBinAlgNew runs in 
0(nd) time. The only remaining bottleneck is Phase 2. 



Lemma 10. Consider a SeqBin(./V, X, C, B) constraint such that B is monotone and 
C is row and column convex under that same ordering tt that gives monotonicity. The 
sets Cf(xij) and Cb(x-ij) r j € D(X{), i € [1, . . . , n], can be computed in time O(d). 

Proof. We give an algorithm to compute Cf(Xi t j). Computing Cb(xij) is similar. Re- 
call that in PathDP (lines QQ, c f (x id ) = {J veD{Xi+1 ),(j,v)£B M^+m) W c(j,v)). 
Since B is monotone, the set of B supports of Xj = j, Support s(xij) = {v\(j, v) e 
BAnG D(Xi + i)}, forms the interval [a, v 1 }, v l < Tr top for some a such that v t is that 
maximum value in D(Xi+i). 

As C is row convex, the interval [a, v*] is partitioned into 3 subintervals [a, v*\ = 
[a, b] U [b, c] U [c, u'] such that c(j, ») = 1, o £ [a, 6] U [c, zj*] and c(j, d)=0,ii£ [6, c] 
and we can write Cf{x itj ) = c 1 U c 2 U c 3 where c 1 = U lJe[a 61aD / x } c/(x i+ i,„) W 



Jve[a,b]nD(X i+1 ) 
Jv£lb,c]nD(X, +1 ) v f(- L i+l,v)<uiu^ — (Jve[c,d]nD(X z + 1 ) 

exploit the fact that c 1 , c 2 , c 3 are computed over intervals to avoid recomputation of the 
indicator values for each c(xij), as was necessary in Lemma [8] We do this with an 
0(d) time preprocessing step that allows us to then compute each c(xi.j) in O(l). This 
reduces the complexity of lines [2|j5]from 0(n 2 d 2 ) to 0(nd). 

The preprocessing step consists in computing cumulative sums over the indicator 
values in an interval. For each indicator value J*, z £ {I ■ zip,r ■ zip,i ■ val}, we 
compute the array csj(z), which counts the number of values in for which the 
indicator value is 1. For example, cs l y zlp (0) = 0, cs l y zlp (v) — cs l y zlp (v — l) + J l y zlp (v), 
y G \p,p + 2], v € D(Xi + i). To compute the cumulative sums we do a linear scan over 
Cf(xi+i iV ), v € D(Xi + i). Given these sums we can compute whether, for example, 
lb(l ■ zip(xi+i. v )) = y, v <G [a', b'] in constant time by checking whether cs l y zlp (b') — 
cs l y zip {a' - 1) > 0. 

The rest of the proof is identical to Lemma [8] (subsection 'Compute c 1 '.). It takes 
0(1) time to compute c 1 given the cumulative sums. We do this for d sets Cf(xij) and 
the preprocessing step takes O(d), so the total time complexity is 0(d). □ 



Corollary 3. Lemma 10 holds if the negation of C is row and column convex under the 



same ordering it that gives monotonicity. 

Proof. The only difference from the proof of Lemma [To] is that the interval [a, u*] of 
supports of each value j € D(Xi) is partitioned into [a, «*] = [a, b] U [b, c] U [c, «*], 
such that c(j, v) = 0, v € [a, 6] U [c, «*] and c(j, ») = l,»e [6, c]. □ 

Example 8. Suppose (xi+i tV ), v g [1,2,3] contain the following forward costs: 
c/(afi+i,i) = [1 - 5 - 8 - 12], c/(x i+1 , 2 ) = [3 ~ 5 - 6 - 10] and c f (x i+1 ^) = 
[2 ~ 6 — 8 ~ 10]. The min value p is 1 and the max value q + 2 is 12. First we compute 
cumulative sums. The table below shows the non-zero vectors of cumulative sums. 
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Cumulative sums 


12 3 


Cumulative sums 
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Cumulative sums 
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Cumulative sums 


12 3 


csi' a * p (u) 
cs 3 " p (v) 


[0 111] 
[0 1] 
[0 11] 


c4 " p (d) 
cs r l2 ""(v) 


[0 12] 
[0 111] 




[0 12 2] 
[0 1] 




[0 11] 
[0 112] 



Suppose that the values 1, 2 and 3 are supports for Cf(xi t %) — [a* ~ b* — c* ~ d* 



Using LemmalO we find U j=[h3] i-val(x i+hj ) = [5-8]U[5-6]U[6-8] = [5-8]. So 
b* = 5 and c* = 8 Then, we check if there exists lb(l ■ zip(a; i+1 j)) = y, y € {1, 2, 3} 
using cumulative sums. For y € {1,2,3} we get that cSy Zip (3) — cs^ p (0) > 0. 
So we set a* and reset b* so that a* = b* = min(2, 1) = 1. Finally, we check if 
there exists ub(r ■ zip(xi+ij)) — y, y € {10, 11, 12}. For y E {10, 12} we get that 
cSy 2lp (3) — cSy P (0) > 0. Moreover, the value q + 1 = 11 does not occur among 
ub{r ■ zip(xi + ij)). Hence, we set d* = 12. This gives Cf(xn) = [1 — 8 ~ 12] □ 

Corollary 4. The filtering algorithm SeqBinAlgNew enforces domain consistency 
on Change and Smooth in 0(nd) time. 

Proof. Change is SeqBin(7V, X, C e {=, 7^, <,<,>,>}, true). This satisfies 



Lemma 10 as {=,<,<,>,>} are row/column convex as is the negation of {y^}. 
Smooth is SeqBin(./V, X, C is {\Xi-X i+1 \ > est}, true), est £ N, is the negation 
of row/column convex constraint {\X{ — Xi + i | < est}. □ 

Corollary 5. The filtering algorithm SeqBinAlgNew enforces domain consistency 
on INCREASINGNVALUE in 0{nd) time. 

Proof. INCREASINGNVALUE(X, N) is SeqBin(X, N, <, =) HI- This version of 
the SeqBin constraint is counting-continuous and therefore c(xij) is an interval. 
Hence, all costs c/,Ch are intervals. Moreover, = is row and column convex, so 
SeqBinAlgNew reduces to SeqBinAlg and enforces GAC in 0(nd). □ 

Finally, we note that we can slightly generalize SeqBin so that it does not require 
the same B and C for every pair of variables as the proof of Theorem [T] does not rely 
on the property that B and C are the same for each pair of consecutive variables. 



5 Conclusions 

The SeqBin meta-constraint subsumes a number of important global constraints like 
Change, Smooth and IncreasingNValue. We have shown that the filtering al- 
gorithm for SeqBin proposed in Qj has two drawbacks even under strong restric- 
tions: it does not detect bounds disentailment and it is not idempotent. We identified 
the cause for these problems, and proposed a new propagator that overcomes both 
issues. Our algorithm is based on a connection to the problem of finding a path of 
a given cost in a restricted rt-partite graph. Our propagator enforces domain consis- 
tency in 0(nd 2 ) and, for special cases of SeqBin that include CHANGE, SMOOTH, 
and IncreasingNValue, in 0(nd) time. 
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A Revisiting SeqBin. 



Lemma 11. The filtering algorithm SEQBlNALG/or SeqBin(7V, X, C, B) with non- 
monotone B is not idempotent. There is a family of problems where SeqBinAlg takes 
0(n) iterations to reach the fixpoint. 

Proof. Figure[5ja) shows the graph representation of SEQBlN(iV, [Xi , X 9 ] ,C,B), 
D(N) = [2,4,6,8]. 

Iteration 1. The minimum cost assignment is [0, 1, 0, 1, 0, 1, 0, 1, 0, 4] with cost 1. 
Moreover, this assignment is the only support for X% = 0. As 1 ^ D(N), X$ = is 
pruned. None of the other value is removed. The first iteration is finished. Figure |5jb) 
shows new domains. 

Iteration 2. The minimum cost assignment is [0, 1, 0, 1, 0, 1, 0, 1, 2, 4] with cost 3. 
Moreover, this assignment is the only support for Xq = and Xj = 1. As 3 ^ D(N), 
these values are pruned. Figure[5jc) shows new domains. 

Iteration 3. The minimum cost assignment is [0, 1, 0, 1, 0, 1, 2, 2, 2, 4] with cost 5. 
Moreover, this assignment is the only support for X 4 = and X 5 = 1. As 5 ^ D(N), 
these values are pruned. 

The reason for this behavior is that possible costs are [1,3,5,7,8], but only as- 
signments with cost 8 are solutions. As SeqBinAlg only computes lower and upper 
bounds on the cost it prunes values that are supported by an assignment with cost 1 first, 
then it prunes values that are supported by an assignment with cost 3 and so on. 

Note that this example embeds a pattern of two variables, e.g. X 4 , X 5 and X 6 , X 7 . 
We can extend the example with an unbounded number of patterns, p = 0(n), on two 
variables and adjust D(N) appropriately. Hence, SeqBinAlg takes p/2 iterations to 
reach the fixpoint. □ 



B Other properties of closeness of costs 

The following lemma shows that there exists only one way to obtain forward cost sets 
for two variable-value pairs Xi — j and Xi — k such that the distance between their 
upper bounds is two. 

Lemma 12. Consider a SeqBin(7V, X, C, B) with monotone B and arbitrary C. Con- 
sider two values j, k £ D(Xi), j -< k. Suppose ubf(xij) = p and ubf(Xi j i e ) = p + 2. 
The following holds: 

1. Vu G D(Xi + i), (j, u) 6 B we have that ubf(xi+i fU ) = p 

2. 3v,w € D{Xi + i), v -< w, (j,v) B and (j,w) € B such that ubf(Xi+i tV ) = 
P + 1, ubf(xi + i_ w ) — P, c(k, v) = 1 and c(j, w) = 0. 

Proof. Figure [6] (a) illustrates the lemma. 

Let v, w be values in D(Xi + i) such that we obtain ubf(xi t k) from ubf(%i+i lV and 
ubf(xij) from ubf(xi+\ >w . 

First, we observe that ubf(xi+x >u ) + c(j, u) < p, otherwise ubf(xij) > p. As the 
cost value is or 1 this implies ubf(xi+% tU ) < p. 
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Fig. 5. A 11-partite graph that corresponds to the SeqBin constraint from the proof of Lemma|l 1| 
Dashed edges have cost one and solid edges have cost zero, (a) shows initial costs; (b) shows costs 
after X-j, — is pruned. 



Second, if (j,v) 6 B then, as (j,w) E B and costs are or 1, | ubf(xi.k) — 
ubf{xi_j) |< 1, a contradition. Hence, (j,v) ^ B. 

Third, we can only obtain ubf(xij) — p and ubf(xi : k) = p + 2 if v -< w and 
ubf(xi + i :W ) < ubf(xi+i :V ). Otherwise, if w -< v, then from (j,w) £ B and the 
monotonicity of B, we get (j, v) <G B, which we showed above is contradictory. If 
v -< w and ubf(xi+i w ) > ubf(xi+i yV ) then, because v -< w and B is monotone, the 
value ubf(xi + i_ w ) or ubf(xi + \ tW ) + 1 is in both sets Cf(xij) and Cf(xi j k)- Hence, 
I ubf(xi : k) — u bf(xi,j) |< 1, a contradiction. 

Therefore, ubf(xij) = p and ubj(xi.k) = p + 2 must be obtained from two sup- 
port values v,w such that v -< w and ubf(xi+i, w ) < ubf(xi+i, v ) and (j,v) £ B. 
By Lemma [5] if v -< w and ubf{x i+ i tV ) > ubf{xi + x tW ) then ubf{xi + \. v ) — 
ubf(xi+i ;W ) + 1. As ubf(xij) = p and (j, v) £ B then there are two cases. 

Suppose ubj(x i+ i tW ) = p and c(j,w) = 0. As ubf(xi_k) = P + 2 then 
= p + 1. 

Suppose = p — 1 and c(j,w) = 1. As ubf(xi > k) = p + 2 then 

u6/(xj+i iU ) must be equal top+1. and c(j, w) — l.As ubf(xi+i,v) — ubf(xi+x tW ) = 2 
then w -< w. This contradicts the fact that i? -< w. 

Hence the only possible upper bound costs for support values are ubf(xi + i, w ) = p 
and ubf{x t+ i, v ) = p + 1. 
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w, ubf (x, +1 , w ) = p 
- ^v, ubf (x t+ i,» ) = p+1 
u, ubf (xt+i, u ) = p - 1 
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w, ubf (x t+ i,w) = P+1 
v, ubf (x w ,» ) = p 

u, ubf (x t+ i,u ) = P + 2 
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Fig. 6. Computation of the upper bound on the forward cost for j,k G D(Xi), j -< k (a) 
ubf(xij) — p and w&/(ajj,fc) = p + 2; (b) demonstrates that $u G D(Xi+i), (j, u) G B 
such that ubf(x i+ i, u ) — p — 1 in case (a); (c) lbf(xij) = p + 2 and lbf(x it k) = p; (d) 
demonstrates that pit g D(X i+ i), (j, it) G -B such that lbf(x i+ i, u ) — p + 2 in case (c); 



Finally, we prove that tt&/(;Ei+i = p for all supports u of j in Xj+i. First, by 
lemmaB] and ubf{x i+ i iV ) > p+ 1, we get > p — 1. Suppose there exists 

u G (j, it) G B such that ubf (xj+i „) = p — 1. We know that there exists 

a support value v, ubf(xi+i jV ) —p+1. Hence, u -< i> by Lemma[5] As (j, u) G -B 
then, by monotonicity (j, u) G B which contradicts the fact that (j,v) ^ B. Figure [6] 
(b) illustrates this situation. □ 

Lemma 13. Consider a SEQBlN(iV, X, C, B) with monotone B and arbitrary C. Con- 
sider two values j, k G D(Xi), j -< k such that lbf(xij) = p + 2 and lbf(x{ t k) = V- 
The following holds: 

- Vu G D(Xi + i), (j, u) G B we have ubf{xi+i. u ) = p + 1. 

- 3v,W G D{Xi + i), v -< w, (j,v) ^ B and (j,w) G B such that lbf{xi + i_ v ) = p, 
lbf(xi + \ tW ) = p + 1 and c(fc, v) = awd c(j, w) = 1. 



Proof. The proof is analogous to Lemma 12 (Figure|6] (c) and (d)). □ 



C The inductive part of Theorem [Tfs proof 

Definition 5. An i-zipper [a ~ 6 — c ~ d] /s non degenerated if either a < b or c < d. 

Note that an interval is a degenerated i- zipper set. 
Example 9. The set 0/(0:1,3) = [1 ~ 5 — 8] is non degenerated. 



C.l Zipper block 



We consider the first block of variables. We prove Theorem [T] for this block together 
with additional restrictions of a structure of cost. 

Lemma 14. Consider a SeqBin(7V, X, C, B) constraint with monotone B and arbi- 
trary C. Suppose Cf(xtj), j € D{X t ), t G [ri2, n] does not contain an interval. Then 
for any Cf(x t j), t G [n 2 , n], j € D(X t ) the following holds: 

- Uniqueness and Overlapping. Cf(x t j) is a zipper set. 

- Cost Structure (upper bounds). If there exist two values j, k G D(X t ) such that 
ubf(x t .k) — ubf(xtj) = 2 and ubf{xt^k) is even(odd) then 

1. j<k 

2. if there exists g such that ubf(xt, g ) is odd (even) then ubf(xt.j) < ubf(xt, g ) < 
ubf(x t ,k). 

3. Ibf(xt.j) = lbf(x tik ) 

4. there do not exist two values g,h € D(X t ) such that lbf{x t f l ) — lbf(x t , g ) = 2 
and lbf(xt,h) even(odd). 

- Cost Structure (lower bounds). If there exist two values j, k 6 D(X t ) such that 
lbf(xtj) — lbf(xt,k) = 2 and lbf(xt t k) is even(odd). 

1. 3<k 

2. if there exists g such that lbf(xt ig ) is odd (even) then lbj(xtj) < lbf(xt. g ) < 
lbf(x t ,k)- 

3. ubf(xtj) = ubf(x t ,k) 

4. there do no exist two values g, h £ D(X t ) such that ubf(xt.h) ~ubf(xt, g ) = 2 
is even(odd). 

Proof. By induction on the distance from n. The base case is trivial, as ubf(x n ,i) = 
lbf(x n: i) — 1 for all i. Suppose this holds for all X t+1 , . . . , X n . We show that it holds 

for-Xt' 

Uniqueness and Overlapping. We prove by contradiction. Suppose that Cf(x t .j) — 
[p, . . . r, r + 3, q] so that there exists a hole of size 2. By induction hypothesis, 
Cf(xt+ij), j € D(Xt+i) are zipper sets. Hence, r and r + 3 must come from dis- 
tinct support values v and w, respectively. As r G Cf(xt.j) and r + 2 ^ Cf(x t .j), 
ubf(x t+1<v ) + c(j,v) <r. Hence, lb f (x t+1 . v ) < ub f {x t+lyV ) <r. 

As r + 3 G Cf(x t j) and r + 1 £ c/(x t j), lbf(x t+ i iW ) + c(j,w) > r + 3. 
Hence, ubf(x t +i, w ) > lbf(xt+i,w) > r + 2. Lemma[5] v -< w and ubf(x t +i, w ) = 
lbf(xt+i, w ) = r + 2. Putting it all together, lbf(x t +i, v ) < r,v -< w and lbf(xt+i, w ) = 
r + 2, we violate Lemma|6] 

We only prove the properties of cost structure for the upper bounds, as the proofs 
for the lower bounds are symmetric. 

Cost Structure(upper bounds). Property^ If ubf(x t k) — ubf(x t j) = 2 then, by 
Lemma|5] j -< k. 

Cost Structure( upper bounds). Property^ W.l.o.g. we assume that ubf(xt.k) is 
even. Consider g such that ubf(xt. g ) is odd. 

Note that ubf(xt, g ) can not be smaller than ubt[xt,j) as ubf(xt,k) — u bf{xt, g ) > 2 
in this case and LemmaBlis violated. On the other hand, ub f [xt, g ) can not be larger than 



ubf(xt^k), as ubf{xt, g ) — ubf(x t j) > 2 and Lemma|5]is violated. Hence, ubf{x tl k) < 
ubf(x t<g ) < ubf(x t j). 

Cost Structure( upper bounds). Property^ We denote lbf(x t ,k) — P- W.l.o.g. we 
assume that p is even. Hence, ubf(x t ,k) is even. 

We prove by contradiction. If lbf(x t .j) ^ lbf(x t ,k) then, by Lemma [6] and by 
the induction hypothesis Cf(xt } j) and Cf(xt,k) are zipper sets, hence, we get that 
Ibf(xtj) = lbf(x t ,k) + 2. 

By Lemma [T3| we know that for all supports u 6 D(X t +i), (j, u) € B we have that 
lbf(x t +i, u ) = p+1. Hence, lbf(x t+ i :U ) is odd. As Cf(x t +i :U ) is a zipper set by the 
induction hypothesis, all supports of the forward cost of j are odd zipper sets. 

Consider the upper bounds on the forward cost. By assumption, ubf(xt j k) — 
ubf(x t j) = 2. We denote ubf(x t j) = p'. Note thatp' is even as ubf(x ty k) is even. By 



Lemma 12 we know that j is only supported by values u £ D{X t +\), (j, u) £ B for 
which ubf(x t +i yU ) — p' ■ As Cf(x t +i. u ) is a zipper set by the induction hypothesis, all 
supports of the forward cost of j are even zipper sets, a contradiction. 

Cost Structure(upper bounds). Property^ We prove by contradiction for even 
ubf(xt,k) and ubf(x t j). The odd case is symmetric. We denote lbf(xt.h) — p and 
ubf(x ttk ) = q. 

Suppose there exist values g, h € D(X t ) such that lbr(x tg ) — lbf(xt y f l ) = 2 and 
lbf(xt,h) is even. Hence, p and q are even. By LemmasBmlwe also know that j -< k 
and g -< h. 

As ubf(xtj) 7^ ubj(xt,k) then lbf{xt l k) = ^f{ x t,j) by Property [3] for the up- 
per bounds. Hence, there are two options: Ibf(xtj) — Ibf(xt.k) = lbf(xt.h) = P or 
Ibf(xtj) — lbf(x t ,k) = lbf(xt, g ) = p + 2. In the first case, we have lbf(xt, g ) — 
lbf(xtf.) = 2. In the second case, we have lbf(x t .k) — ^/(^t,fe) = 2. Hence in both 
case, by Property [3] for the lower bounds, ubf(x t j) = ubf{x tg ) = ubf(x t ,h)- This 
leads to a contradiction. □ 

We will show that there exist a bounded number of distinct sets Cf(xi.j), j E 



D(Xi), i — 1, ... ,n. Lemma 14 shows that all of them are zipper sets, however, these 
zipper sets might have different lower and upper bound that satisfy Lemma [14] 

Lemma 15 enumerates of all distinct sets Cf(xij), j <= D(Xi). We know that each 
of Cf(Xij) is a zipper set ( and/or an i-zipper set in the next layers). However, these 
zippers are subsets of the interval [p, q], p = imn(lbf(xij)) and q = ma,x(lbf(xij)), 
j € D(Xi). Hence to describe all possible Cf(xij), in addition to the two possible 
structures, we need to take into account two differences: lbf(cf(xij)) — p and q — 
ubf(cf(xi.j)). As we use enumeration of all possible Cf(xij) for other layers we define 
the notion of a framed- structure as one of the two possible structures, a zipper or an 
i-zipper, together with two differences lbf(cf(xi j)) — p and q — ubf(ct(xij)). For 
example, \p — 1 ~ q — 1] and [p + 2 ~ q + 2] have the same structure, a zipper set, but 
there are two distinct framed- structure as they have different lower and upper bounds. 



Lemma 15. Consider a SeqBin(7V, X, C, B) constraint with monotone B and ar- 
bitrary C. Suppose Cf(xt.j), j G D(X t ), t € [t',n] do not contain intervals. Let 
p = mirij eD(x t ){lbf(x t j)} andq+l = maXj eD ^x t ){ubf(x t! j)}. 



1. Suppose there exist two values j, k G D(X t ) such that ubf(xt^k) — ubf(xt.j) = 2 
and ubf(x t .k) is even(odd) and there exist two values g,h G D{X t ) such that 
lbf(xt t g) — lbf(xt l h) = 2 and lbf(x tt g) is odd(even). Then {j,g} -< {k, h}. More- 
over, lbf(x t ,h) < lbf(x t j) = lbf{x t ,k) < lbf(x t ,g) andubf(x t j) < ubf(x t , g ) = 
ub f (x t ,h) < ubf(x ttk ). 

2. Cf(xt,j) is one of four framed- structure which are zipper sets together with their 
lower and upper bounds in Figure^a). We label 4 possible framed-structures of 
Cf(xt,j) as ST1, ST2, ST3 and ST4. 



Proof. Property^ We only need to show that g -< fc and j ^ ft as the rest follows from 
Lemmas [5}j6] 

We denote lbf(x t ,h) = P and ubf(xt 1 h) — q- Hence, lbf(xt, g ) = p + 2 and 
ubf(xt <g ) = q. As ubf(xt t k) and ubf(x t j) are even and ubf(x t j) — ubf(x t ,k) = 2 
then, by Lemma 14 Property [2] for the upper bounds we know that each odd zipper 
set must start at p + 1. Hence, lbf(x t j) — lbf(xt,k) = p + 1. Therefore, taking into 
account that ubf(xt.h) — Q> the only choices for the upper bounds of k and j are 
ubf(xt,k) = q + 1 and ubf(xtj) = q — 1. Otherwise, we would violate Lemma|5] 
Hence lbf(x t ,h) = p < lbf(x t ,j) = lbf(x tj k) = P + 1 < lbf(x tg ) = p + 2 and 
ubf(xtj) = q - 1 < ubf(x t>g ) = ubf(x t . h ) = q < ubf(x t ,k) = q + I- Figure ^a) 
shows the structure of Cf(xtj), Cf(xt 1 k),Cf{xt,h) an d Cf(x t . g ). 

By an exhaustive search over all possible support values v,w G D(X t +i) that 
satisfy Lemmas [6}j5] we find that the only possible structure of support values for each 
pair j -< fcandg ~< h is the following: Cf(x t +i, v ) = [p ~ q] and Cf(xt+i, w ) = [p+1 ~ 
q — 1], Figure C.l illustrate the support values v, w. 
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If k -< g and (u, k) E B then (ft, v) £ B and the c/(xt+i jt ,) = [p ~ g] or [p + 1 ~ 
g + 1] is included in c/(a: tjS ). Hence, ubf(x t , g ) > g. This leads to a contradiction. The 
case ft -< j is symmetric. 

Property^ The first property is immediate from Lemma 14 and Property [Tj Fig- 
ure|7ja) illustrates all possible framed-structures. □ 



Example 10. Consider the variable X4 in Example|4] The set 0/(0:4,3) = [1 ~ 5] is the 
framed-structure ST2 and 0/(2:4.2) = [3 ~ 5] is the framed-structure ST A. □ 
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Fig. 7. Intervals are highlighted in blue, zipper sets are in yellow and an i zipper sets are in 
orange. All possible framed-structures of (a) Cf(x ni +2,j), j G D(X ni +i,); (b) Cf(x ni +2, j), 
j G D(X ni+2 ) and (c) c f (x ni+1 ,j), j g D(X„ 1+l ) when STl, ST2, ST3, ST4 are present 
among c f (x n2 ,j); 



C.2 Zipper + i-Zipper block. 

We consider the second group of variables. Lemma|4]shows that this group contains at 
most two variables. We assume here that it contains exactly two variables, as this is the 
most general case. Next we refine the structure of Cf(x.i,j), i E [ni + l,m + 2] and 
prove Theorem [T] 



®&& Xn,+2 CD®® 



Fig. 8. Intervals are highlighted in blue, zipper sets are in yellow and an i-zipper sets 
are in orange. All possible framed-structures of c/(x ni +2,j), j G D(X ni +2) when only 
STl, 5T3, ST4 are present among Cf(x ni+ 3,j); 



Lemma 16. Consider a SEQBlN(iV, X, C, B) constraint with monotone B and arbi- 
trary C. Suppose Cf(x s j), j G D{X S ), s G [ni + l,m + 2] are zipper sets. Suppose 
there exist j,j' G D{X t ), t = ri\ + 2, and k, k' G D(X t -i) such that Cf(x t j) and 



Cf(xt-i,k) ore zipper sets and Cf(xt.j') and Cf(xt-i.k') are i-zipper sets. Then the 
following holds for X t _i and X t : 

- Uniqueness. Cf(xtj) is either a zipper set or an i-zipper set [a ~ b — c ~ d] where 
either a — b and/or c = d. 

- Overlapping. If there exist two values j, k £ D(X t ) such that Cf(xtj) and Cf(xt^) 
are two non-degenerated i-zipper sets [a ~ b — c] and [b' — c ~ d](or[d^ b' — c]), 
respectively, then [b, c] n [b' , d] ^ 0. 

- Cost Structure. 

1. If there exist two values j, k £ D(X t ) such that Cf(xtj) and Cf(xt,k) ore two 
non- degenerated i-zipper sets [a ~ b — c] and [b 1 — c! ~ d] then a — b' and 
c = d. 

2. If there exists a value j £ D(X t ) such that Cf(x t .j) are non-degenerated 
i-zipper sets [a ~ b — c] ([&' — d ~ d]) then b = a + 2 (d = c' + 2). 

Proof. First, we prove the lemma for X t . Lemma [15] Property [2] defines all possible 
structures of Cf(x t +i, v ), v £ D(X t+ i). 

We perform complete enumeration of all possible structures of Cf(x t .j), j £ D(X t ) 
taking into account that {j, g} -< {k, h}. Figure[7]b) shows the result of the enumera- 
tion if ST1 — STA set framed-stmctures are present among Cf(xt+i.j), j £ D(X t +i). 
We label all framed-structures as ET1 — ET9. We also show the type of each framed- 
structure. To keep presentation clear we omit actual values for the lower bound and 
upper bounds and use [b, c] to denote an interval, [a ~ b] to denote a zipper and 
[a ~ b — c] or [b — c ~ d] to denote an i-zipper. Note that if one of the framed-structures 
ST1 — STA is missing then we get a subset of framed-structures of ET1 — ET9. For 
example, Figure [8] shows the result of the enumeration if ST1, ST3, STA are present 
among Cf(x t +ij), j £ D(X t +i). Hence, we can assume that all 4 framed-structures 
are presented among Cf(x t +ij), j £ D(X t +i). 

From the framed-structures ET1 — ET9 we see that in addition to zipper sets we 
only introduce degenerated i-zipper, which are intervals, {ETA, ET5, ET6, ET8) and 
i-zipper sets with non degenerated left side [a ~ b— c], b = a+2, ET7, and i-zipper sets 
with non degenerated right side \b' ~ d — d], d = d + 2, ET9. Moreover, the lower and 
upper bounds ET7 and ET9 coincide (this comes from {j, g} -< {k, h}). The statement 
of the lemma follows from ET1 - ET9. Note that ET3 must precede all ETi that 
contain p by Lemma|6] We denote ET U (ET W 1) = {ETi, ETi W = 1, . . . , 9}. 

Second, we prove the lemma for X t -\. We use a complete enumeration of cases 
again. This enumeration is feasible due to the restricted framed-structure of ET U 
[ET l+ll). Figure [^c) shows the result of the enumeration (note that we do not show 
restrictions on the partial order between structures.) In fact, it is sufficient to con- 
sider all sets that can be formed as the union of two sets in ET U (ET l+J 1). Let 
S = l J Sl .s 2 eETu(ETwi){ s i U $2} be the set of all sets that can be obtained by the 
union of two sets in S. Taking the union of any three sets s±, S2, S3 £ ET U (ET l±l 1) 
is in S again. So we reach a fixpoint after considering all sets of size two. The resulting 
structures show that the lemma holds. We do not specify any restriction on the relative 
order of the framed-structures. □ 



C.3 i-Zipper block. 



Lemma 17. Consider a SeqBin(./V, X, C, B) constraint with monotone B and arbi- 
trary C. If c(x ni +i, j) is an i-zipper set [a ~ b — c] then b — a < 4. Similarly, If 
c{x ni+ i, j) is an i-zipper set [b — c ~ d] then d — c < 4. 

Proof. Letp = min lbf(x ni +i,j). There are two ways to obtain i-zipper set [a ~ b—c]. 
Firstly, we can extent an existing i-zipper set. By Lemma [16] we know that for any 
i-zipper set [a ~ b — c] we have b = a + 2. As a 6 + 1} for any such i-zipper set 
we can extend a zipper with at most one hole on the left. For example, we can combine 
the i-zipper set NT4 shifted by one and the zipper NTH, (NT4 l+l 1) U NTH to 
obtain the i-zipper set [p ~ (p + 4) — (q + 2)]. Example [^demonstrates this situation: 
c/(afi 3) = [1 ~ 5 — 8]. Hence, any extension of an i-zipper set satisfies the lemma. 

Secondly, we can create a new i-zipper set. However, as all lower bounds are on 
distance at most 2 from each other we cannot obtain a new i-zipper set [a ~ b— c] with 
b — a > 2. Hence, the lemma holds. □ 

Lemma 18. Consider a SeqBin(./V, X, C, B) constraint with monotone B and arbi- 
trary C. The following hold for all i G [1, n\ + 1] 

- Interval monotonicity. If-K tov is the top value ofD(Xi), then the set Cf(xi^t op ) is 
an i-zipper set [a ~ b — c ~ d] with c > b. 

- Overlapping intervals. If there exist two values j,k G D(Xi) such that Cf(xij) 
and Cf(Xi k) are two i-zipper sets [a ~ b — c ~ d] and [a' ~ b' — cf ~ d'], 
respectively, then [b, c] fl [b' , cf] ^ 0. 

- Unique interval. Cf(xij) is an i-zipper set. 

- Cost structure. For any i-zipper set Cf(xi j) = [a ~ b — c ~ d] ;f fto/ifa 6 — a < 4 
anc/ cZ — c < 4. 

Proof. Interval monotonicity. By Lemma |4] we know that the top value 7r top ,7r* op G 
D(X ni+ i) contains an interval. Hence, Cf(x ni+l 7T to P ) is an i-zipper set. By the mono- 
tonicity of B, the top value of all Xi, i G [1, m] must also have an interval. 

Overlapping intervals. We show by induction on distance from n\ + 1. The base 



case was proven in Lemma 16 Now suppose this holds for all Xt+i, ■ ■ ■ , X nx +i. We 
show it holds for X t . Let 7r' op be the top value in D(X t +i). By the inductive hypothesis 
it contains the interval [a, b], b > a. By monotonicity of B, ir top supports all values of 
X t . Therefore, all Cf(x t j) will contain [a, b] or [a + 1, b + 1] for all j G D(X t ), so all 
pairs of values of X t contain intervals that overlap. 

Unique intervals. We again show this by induction on the distance from n\ + 1. The 



base case was proven in Lemma 16 Now suppose this holds for all X t+ i, . . . , X ni+ i. 
We show it holds for X t . 

Suppose there exists a value j G D(X t ) such that Cf(xtj) contains two distinct 
intervals. Consider first any two supports v and w of j in X t+ i that contain intervals. 
By the inductive hypothesis, Cf(xt+i, v ) contains a unique interval [a, b] and Cf(x t +i. w ) 
contains a unique interval [c, d] such that [a, b] and [c, d] overlap. Since v and w support 
j, Cf(xt.j) contains [a, b] + c(j, v) which is [a, b] or [a + 1, b + 1] and [c, d] + c(j, w) 
which is [c, d] or [c + 1, d+ 1], Shifting both overlapping intervals by the same amount, 



i.e., c(j, v) = c(j, w), yields overlapping intervals again, so their union is an interval. If 
we shift only one, i.e., c(j, v) ^ c(j, w), the intervals will in the worst case be adjacent, 
therefore their union is an interval. Because the union of any two supporting intervals 
yields an interval, the union of all supporting intervals also yields an interval. Therefore, 
if Cf(xtj) contains two distinct intervals, one of them must come from the union of the 
parts of the supports that form zipper sets. 

Assume that Cf(x t j) is of the form [a ^ b — c ~ d — e ~ /], 6 < c, c + 1 < d < e 
and, w.l.o.g., assume that the interval [b, c] is the union of zipper sets and the interval 
[d, e] is the union of intervals. 

Since the interval [b, c] is the union of two or more zipper subsets of i-zipper sets, 
one of the zipper subsets, say Cf(x t +i. v ), must terminate at c — 1, i.e., ubf(x t +i. v ) — 
c — 1, otherwise Cf(xtj) would also contain c + 1 and [b, c] would not be a maximal 
interval, is exactly a zipper set, otherwise it would contain 

Moreover, one of the supports of j that support the interval [d, e], say w, must be 
such that Cf(xt+i : w) contains the value e — 1, so ubf(xt+i, w ) > e — 1 > d > c + 1, so 
ubf(x t +i jW ) > c+1 = (c-l)+2 = ubf(x t +i, v )+2, or ubf(x t +i, w ) > ubf(x t +i, v ) + 
3 which contradicts Lemma [5] 

Cost structure. By Lemma [4] and the unique intervals property above we know that 



any c(x ni ,j) is an i-zipper set and it must contain an interval. By Lemma 17 we know 
that for any i-zipper c(x ni , j) = [a ~ b — c ~ d], j G D(X ni ), b — a < 4 and d—c < 4. 
Hence, lemma holds for the riith layer. The important observation is that starting from 
layers n\ — 1 none of c(xi,j) = [a ~ b — c ~ d], j G D(X t ), t £ [1, . . . , n% — 1], has 
a support value at the (t + l)th layer whose cost set is a zipper set by Lemma|4] 

Suppose the statement holds for t — 1 layers starting with the n f 1 h layer. To obtain an 
i-zipper set Cf(x t ,j) we must combine two support values v, w € D(X t+ i) such that 
c/(xt+i, u) — [a u ~ b u — c u ~ d u ], u G {v, w}, b u — a u < 4 and d u — c u < 4 as there 
are only available framed-structures of c(xt+i, u), u 6 D(X t +i) that contain holes. As 
[b v ,c v ] n [&«,, Cu,] 7^ 0, |ov - a w \ < 2 and |d„ - d w | < 2, b v - a v < 4, d v - c v < 4 
^t« — a™ < 4 and d w ~c w < 4, the union of any subset of Cf(xt+i, j)U(cf(xt+i, + 
can not create an i-zipper set [a' ~ 6' — c' ~ d'] such that fe— a>4ord — c>4. 
Hence, the lemma holds for the t th layer. □ 



Corollary 6. The number of distinct sets ofcf(xij), j e D(Xi), is bounded at the ith 
layer, i = 1, . . . , n. 



Proof. Consider the set Cf(xij), j £ D(Xi). By Lemma 15 each variable in the zipper 
block of variables contains at most 4 different framed-structures of costs. By LemmafTo] 
each variable in the zipper + i-zipper block of variables contains at most 18 different 



framed-structures of costs. By Lemma 18 each variable in the i -zipper block of variables 
contains at most 9 different i-zipper sets. Each of them can start/finish in at most 3 
points. This gives at most 81 possible distinct framed-structures. Hence, the number of 
distinct sets of Cf(xij), j € D(Xi) is bounded. □ 



D Domain consistency algorithm (omitted proofs) 



Lemma 19. Consider a SeqBin(./V, X, C, B) constraint such that B is monotone. For 
for allj G D(Xi), ie [l,...,n] Cf(x ij j) = \JveD(x z+1 ) i c fi x i+i,v) w c 0») can be 
computed in 0(d) time. 

Proof. We partition all supports v into two groups based on the value of c(j, v). The first 
group So contains values such that c(j, v) — and the second group Si contains values 
such that c(J,v) = 1. We find c 1 = \Jves c f( x i+i,v) and c 2 = U„ G Si c f( x i+i,v)- 
Then we find Cf(xij) — c 1 U (c 2 l±J 1). We prove lemma for c 1 (c 2 is analogous.) 

Compute c . We assume that p is the smallest lower bound among the forward cost sets 
of the values in Sq and q + 2 is the greatest upper bound: p = mm ve s lbt(xi+i v ) and 
q + 2 = max„ £ s (l u6/(a;i+i )t) ), We refer to ^ • zip of Cf(xi+i )V ) as Z ■ zip(xi+i v ) to 
simplify notations (similarly, for the other two parts i ■ val and r ■ zip). By Theorem[T] 
we know that lb(l ■ zip(xi+x,v)) G [p,P + 2], w6(r • zip(x i+ i !tI )) e [g, g + 2], Z6(i • 
wa^Xi+i.t,)) G [p,p + 6] and ■ wa/(xi+i it ,)) e [g — 4, g + 2], Hence, we compute 
the 20 indicator values J l y zvp {v), J r y ztp {v), 'j l y val »>(v) and J l y val ^(v), v G 5 . We 
define ^^(w) = 1, iff J6(I • 2»p(x i+ i l0 )) = y and J l y zi r(S ) = max„ eSo 4' zi p( V ). 
Similarly, we compute the other 19 indicators. This can be done in 0(d) time with a 
linear scan over cj(x i+ i tV ), v G Sq. 

Then we can compute U u <es c f( x i+i,v) = [a* ^ b* — c* ^ d*] in 4 steps, each of 
which takes 0(1) time. 

Union ofi ■ val. Theorem[T]shows that all i ■ val sets must overlap. Hence, the union 
of i ■ val(xi+ij) forms an interval. We find the minimum value y, y £ {p, . . . ,p + 6} 
such that J£ val "> (Sq) = 1. If such a value y exists then we set b* — y. Then we find the 
largest value y' G {q - 4, . . . , q + 2} such that J^ valub (S ) = 1 and set c* — y'. Note 
that if y exists then y' exists. If y does not exist we know that all Cf(xi+i :V ), v G So are 
zipper sets and we set b* = c* =0. 

Union ofl-zip. Suppose b* ^ 0. We find indicators J y zlp (So), y € [p,p+2], that are 

set to one. Set p' to the minimum among [p, p+2], for which there exists J l p , zlp {So) = 1. 

If J l p +f(So) = 1 and J l p zip {S ) = 1 or J l p +f(S ) = 1 or 6* G {p,p + 2} then set a* 
and reset 6* a* = &* = min(p + l,p') otherwise set a* = p' and leave b* unchanged. 
Union of r • zip is similar to union of I ■ zip. 

Union of zippers. Suppose b* = 0. Then we determine which of 4 distinct sets 
(Lemma 15 i are presented among Cj(xi+i lV ), v G Sq. As there are at most 4 such that 



are zippers we can union them in 0(1) time and identify values of a* ,b* , c* and d*. 

Compute c 1 U (c 2 l±l 1) . 

The l+l operation takes O(l) time. We denote c' 2 = (c 2 l±J 1). We show that we 
perform c 1 U c' 2 in O(l). We only consider the case when c 1 and c' 2 are i-zipper, 
c 1 = [a - 6 - c - d],c' 2 = [(a' + 1) - (6' + 1) - (d + 1) - (d' + 1)]. Reasoning 
about other cases is analogous. 

We construct the union in three steps Cf(xi j) = [a* ~ b* — c* ~ d*] = c" 1 U c" 2 U 
c" 3 , where c" 1 = [a - 6] U [(a' + 1) - (5' + 1)], c" 2 = [6, c] U [(6' + 1), (c' + 1)] and 
c" 3 = [c~d]U[(c' + l)~(d' + l)]. 



By Theorem [T] we know that [b, c] n [b', d] ^ 0. Hence, [b, c] and [b' + 1, d + 1] 
intersect or touch so that c = 6' — 1 if b < 6'. In any case c" 2 = [6, c] U [(6' + 1), (d + 
1)] = [6*, c*] is an interval, where b* = min(6, (b' + l)) and c* = max(c, (c'+l)). Next 
we consider the / • zip parts of c 1 and c' 2 , [a ~ 6] and [(a' + 1) ~ (6' + 1)], respectively. 
It suffices to restrict these sets to [a ~ b*] and [a' ~ 6*] as 6* = min(6, (6' + 1)). 
Note that c" 1 = [a ~ 6*] U [(a' + 1) ~ b*] can be either an interval or a zipper and 
an i zipper of the form [r ~ g — 6*] depending on the parities of a and a'. In any case, 
by Theorem [T[ b* — a < 4, b* — (a' + 1) < 4, hence there is a bounded number of 
outcomes and we compute the union in constant time. Similarly, we compute c" 3 which 
is the union of [c* ~ <f] and [c* ~ (d' + 1)]. Hence, we can compute the union c 1 U c' 2 
in O(l) time. 

Complexity analysis . For each forward cost set Cf(xij), j G D(Xi),i £ [1, . . . , n] can 
be computed in O(d). As we have 0(nd) forward cost sets, the total time complexity 
is 0(nd 2 ). Note that one way to reduce this complexity is to compute Cf(xij) in O(l). 

□ 



